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ABSTRACT 

The new method of invariant definition of the measurable angle of light 
deflection in the static central symmetric gravitational field is suggested. 
The predicted pure gravitational contribution to the deflection angle slightly 
differs from its classical estimate and one may hope that this discrepancy 
could be experimentally detected in the near future. 

1. Introduction 

The phenomenon of light deflection in a weak static central symmetric gravitational 
field is one of the most simple effect predicted by the general relativity that can be really 
checked in experiment. The first calculation of the gravitational light deflection in frames 
of general relativity has been performed by A.Einstein (ref. [1], Eq. (74)), and the recent 
version of his formula more appropriate for description of experiment has been obtained 
in [2] and [3]. Up to now the theoretical prediction and experimental results have not 
revealed any noticeable disagreement. Probably this is a reason why the theoretical calcu- 
lations which are expounded in a number of textbooks and lead to the common wellknown 
result are hkely never be undergone sufficiently careful analysis concerning the connection 
between the geometrical tools of the theory and the results of the physical measurements. 
Otherwise it is difficult to explain why it has not been mentioned in the literature that the 
main formula determining the experimentally measurable value of the gravitational light 
defiection must be revised. Nowaday the accuracy of experiment [4] has almost achieved 
however the threshold (about 10~^ radians, see Sec. 3 and 5 below) of a detectability of 
the distinction between the classical prediction and more consistent one obtained in this 
paper (Eqs. (22) and (A22) below). 

Let us be more specific however. In the previous work by the author [5] the method 
of the invariant geometrical description of the deflection of light rays in the gravitational 
field of a static central symmetric massive body has been developed which, essentially, 
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operates with the quantities measurable in principle. To that end, it has been suggested to 
introduce the notion of the physical Riemannian space that, under the restrictions assumed, 
exhaustively characterizes the basic space relations available to observers and referring to 
the notion of a distance (including the angle between the space directions in particular). 
If one adopts such an approach then the sense of the light ray deflection becomes quite 
clear. Indeed, the shortest and at the same time straightest line in the physical space is its 
geodesic (the p-geodesic for brief) and the deflection of the light ray {i.e. the projection 
of the null space-time geodesic, the s-geodesic for brief, into the physical space) means 
precisely its deviation from the p-geodesic of the same initial direction. Such a deviation 
really occurs in a generic situation and one may suggest some quantitative characteristics 
describing it including, in particular, the total deflection angle. This approach seems to 
be the most natural global geometric interpretation of the effect of a distortion of a light 
beam by the static gravitational field. There are some reasons to name the corresponding 
deflection definition global (see also [5] for more details). 

In practice, however, the principle of the measurement of the light deflection angle 
is absolutely different as wefl as the implicit definition of the deflection angle itself. It 
is in no way surprising since the sufficiently long pieces of a light beam trajectories are 
not reafly available to observer and it is difficult 'to stretch' the p-geodesics in the space 
in order to compare them with the light beams trajectories. Besides, the observations of 
the disposition of the images of removed sources on the celestial sphere of an individual 
observer are used. The defiection angle is realized as the arc between the 'usual' (or 'true') 
position of the image of the removed source on the celestial sphere and its position when 
the fimb of the central massive body (the Sun as a rule) is close to it. Such a definition of 
the defiection angle can be named /oca/ in contrast to mentioned above since, in particular, 
the observer need not leave the local observatory where he observes the sky. 

The latter remark does not mean however that the concept of the physical space can- 
not be applied to the description of the light defiection measured by the local method and 
we shall consider here the corresponding theoretical background. Besides the 'pure gravi- 
tational' defiection, the infiuence of the orbital motion of the observer is taken into account 
in a uniform way and this generalization does not lead to any noticeable complications. 

We shall find that the classical formula determining the defiection angle which can 
be found in many wellknown textbooks (see, for example, [6-11] and also the discussion in 
the Section 5 below) should be modified even if there were no rotation of observer around 
the field center. 

It is perhaps worthwhile to emphasize here that the new result is derived just in the 
frames of the standard general relativity and is merely due to the more consistent and 
essentially invariant treatment of the connection between geometrical tools of the theory 
and the results of physical measurements. The magnitude of discrepancy between the 
classical and new predicted values is expected to be on a level of the current experiment 
uncertainty and thus one may hope that it could be experimentally distinguished in the 
near future. 

The parer is organized as follows. In the Section 2 we obtain the basic system of 
equations ensuring the rigorous definition of the locally measured light deflection angle. 
The appropriate Taylor expansion yielding the explicit formula is considered in the Section 
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3. It is fitted to the case of the Sun-Earth system in the Section 4. In the Section 5 we 
discuss the interpretations and the order of magnitudes of the separate terms in the result- 
ing formula. In the Appendix the possible alternative arrangement of the measurement of 
deflection angle is briefly analyzed. 

2. The physical metric and positions of light sources on the celestial sphere 

Let us consider a static spherically symmetric space-time whose metric, when re- 
stricted to the hyperplane 2 = 0, has the form 

ds^ = F{r)dt^ - G{r)dr^ - H{r)d(j)^ . (1) 

We drop the 2-dimension since it will be sufficient for us to consider the motions of observers 
and the light rays geometry in the hyperplane 2 = 0. 

We assume the space-time to be asymptotically flat so if one chooses r coordinate in 
such a way that H = then F and G will tend to 1 as f tends to infinity. 

Further, let some space-time area be filled up continuously by a family of observers 
uniformly rotating around the center r = of the field in accordance with the equations 



4> = ut + q, r = P{p). (2) 

Here co is some constant, p and q are the parameters uniquely identifying every individual 
observer (the usual identification mod 1% for the angle parameter q is assumed). The 
function P{j)) is to be determined from the equation 

' tr. 



.^yv) 

p = \ ^/G{^dr 



A choice of the integration constant is clearly immaterial here. 

The exchange of light signals by every pair of neighboring observers labeled by the 
parameters {p, q) and {p + dp, q + dq) respectively shows that the distance dl between them 
does not depend on a moment of measurement and is determined by the quadratic form 

dl^ = V + , dq\ (3) 

F{r) - LO^H{r) 

where r should be considered as the function of p. We use the system of units where the 
speed of light is equal to 1 and assume F{r)-uj'^H{r) > in the area under consideration 
(in the flat space the latter would merely mean that the orbital speed of observers does 
not exceed the speed of light). The worldlines (2) of observers are obviously the orbits of 
the Killing vector fleld that automatically implies the constancy of the distance (3) 'in the 
course of time'. 

We see therefore that the observers sense themselves to be immersed in the static 
Riemannian space with the metric (3). It is obviously the only representation of the 
ground level properties of metricity of the space available to them, the latter statement 
being equivalent to the claim that the speed of light measured in local experiments is the 
world constant even if the space-time is curved. 
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We shall name the Riemannian space endowed with the chart (p, q) and the metric 
(3) the physical space and the physical metric respectively. 

Now we remind that the null geodesic of the metric (1) is governed by the equations 



d(f) = e\ 
dt = e 



GF [1 - \^F/H) ^ 
GF-^ {1-\^F/H)~^ 



1/2 



nl/2 



where the positive branch of square root is assumed. A is the constant which is usually 
called the impact parameter of the ray. The sign of A determines whether the coordinate 
(f) locally increases or decreases along the (oriented) trajectory of photons. Symbol e = — 
for the initial half of the ray when the photons approach the center and e = + for the final 
half of ray when it tends to infinity. (We consider only the rays that are not seized by the 
gravitational field in the central region.) 

In accordance with the Eqs. (2) the rotating observers find the following relation 
governing the trajectory of the light rays (projections of oriented null space-time geodesies) 



n{v) = 0, where Q = dq - e\x{F/Hf^'^ - uj {H/Ff^'^] {H - X'^F)-^/'^dp, 
and V is the physical space vector tangent to the ray. 

We see that the direction of the light ray is described for rotating observers by the 
following vector of the physical space which is tangent to the ray, normalized to the unit 
and annihilates the form ft: 

V = V^X, e) = eT{X, lo)Xi + W{X, lo)X2 , 

where T(A,w) = {1 - Xlo)-\HF)-^/'' [{F-lo''H) [H-X^F]]'''' 
W{X,uj) = (l-Xuj)-^ \X{F/Hf^^ -uj{H/Ff^^' 



(4) 



Xi = dp, 

X2 = {F- Lo'^Hf/\FH)-^/'^dg. 

The pair of vectors {Xi , X2} is chosen to constitute the orthonormal frame in the physical 
space. 

The vector v is precisely the vectorial light speed as it is perceived by the rotating 
observer and whose direction coincides with the 'mechanical' one of the axis of a telescope 
directed to the source. The observed angle between the images of two sources on the 
celestial sphere is nothing else but the angle between the corresponding light speed vectors 
in the point of observation. Further, the angle between the vectors tangent to two light rays 
having the common point and attached to that point has to be determined with respect 
to the physical metric and the cosine of the angle equals their scalar product. Thus if the 







rays are characterized by the parameters (e',A' ) and (e",A") respectively the angle V" 
between them is determined by the equation 

exp(i 'u") = [e'T{X\Lo) + iW{\' ,Lo)][e''T{X'' ,Lo) - iW{X'' ,Lo)]. (5) 

This equation is the basic one yielding in particular the unique rigorous invariant 
interpretation of the notion of angle between two removed light sources as it is perceived 
by the chosen family of observers. Its sense becomes more transparent if one uses the 
dualization operator * defined in action on the orthonormal frame as follows: *Xi = 
X2, *X2 = —Xi; then exp(i 'u") =< Vi\v2 > + i < Vi \ *V2 > (< | > denotes the scalar 
product in the metric (3)). 

Now we assume that the space-time is vacimm outside some region around the center 
occupied by the static massive body, i.e. F = = 1 — 2ml r, H = r^ for r greater than 
some rmin > ^tn. The most of the reasoning below remains however valid in the case of 
arbitrary asymptotically flat space-time as well and one will easily distinguish the generic 
assertions. 

Let us consider the following observation scheme (see the Fig. 1). The observer which 
determines the positions of removed radiating sources (stars or quasars) on his celestial 
sphere uniformly rotates around the central mass along the orbit with some constant value 
of the coordinate r > Tmin- He can be included in a natural way into a family of uniformly 
and coherently rotating observers. We assume that the distances and angles are measured 
by observers in the manner described above. This is adequate in the frames of relativistic 
theory (up to the inevitable idealizing) to the corresponding practical means. 

Further, we assume for simplicity that there exist two light sources exactly in the 
plane of observer's orbit with the arc separation between them to be not very small. They 
are periodically eclipsed by the central massive body. We shall call one of them the master 
source and another the reference source for brief. 

One has the following natural definition of the light deflection angle. It equals the 
difference of the arc separations between the images of the master and reference sources 
measured in two cases, first, when the master source is close to the limb of the central 
body while the reference source is comparatively far from it and, second, when the both 
sources are removed from the limb (on the celestial sphere) as far as possible. 

Thus the observer needs carry out two observations. The flrst of them is timed to the 
moment close to the eclipse of the master source by the limb of the massive central body. 
We shall call it the principal observation P. 

The second, calibrating observation C is carried out in a typical situation approxi- 
mately a half of a revolution period after (or before) the principal one. Then the parts 
of the both null geodesies forming the images of the master and calibrating sources lie 
completely outside the observer's orbit and are removed in the space as far as possible 
from the center body. At the same time the images of the master and reference sources 
are maximally removed from the limb of central body on the celestial sphere. 

Let the observer register during the principal observation the following four light rays. 

The first is the ray jC from the master source passing near the central body and let it 
be characterized by the (unknown) A-parameter Aq > 0. It is obvious that the ray C has 
passed the point closest to the center when it is detected by observer {i.e. e = -|-). 








Figure 1. The sketchy picture of the relevant hght rays in the plane 
of the observer's motion is presented. The crossed inclined and 
horizontal lines denote the congruences of rays emanated by the 
calibrating and master sources respectively. Dotted circular line 
is the orbit of observer, the less circle around the center represents 
the section of surface of the central massive body. Other notations 
are explained in the text. The arrows attached to the points P 
and C represent the tangent vectors to the rays but it is important 
to remember that the depicted angles between them (denoted by 
directed arcs) have no physical meaning and do not coincide with 
measurable angles that, in particular, depend on u. 



The second one is the ray M from the reference source detected at the same moment 
of the principal observation. Let it correspond to the A-parameter A2 > 0. We assume 
for definiteness that the ray M has also passed the point closest to the center (although 
the opposite assumption is also admissible). This means mainly that the angle ( between 
the rays C and M, coinciding with the arc separation between the master source and the 
reference source on the celestial sphere during the principal observation, is somewhat less 
than the right angle at least. This restriction is in fact technical in nature but it covers a 
wide class of the physically meaningful conditions of measurements. 

We assume additionahy that the angle ( is not very small since otherwise the both 
disturbed and undisturbed angles between the sources images clearly vanish together with 
their difference. 

The third and the fourth rays A/""*" and J\f~ registered by the observer are emanated 
by the points of intersection of the observer's orbit plane and the boundary of the limb of 



the central body. The A-parameters A3 > of the ray Af~^ and —A3 < of J\f~ differs in 
sign (A3 is also unknown). The angle x between J\f~^ and Af~ yields the arc diameter of 
the limb. Another relevant measured angle is the arc separation k between the image of 
the master source and the point of the limb closest to it, i.e. the angle between the rays C 
and J\f'^. In fact, we shall not need know the angles k and x separately. As we shall see 
below, it is the angle a = k + x/2 between the master source and the center of the limb 
which really enter the relevant equations and crucially determines the purely gravitational 
deflection angle. 

All the e-parameters of the rays C, M.,J\f'^ ,Af~ equals + {i.e. the r coordinate in- 
creases along all the rays near the point P). 

The arc separation il> between the master and reference sources registered during 
the calibrating observation may be considered to be undisturbed or, to be more exact, 
minimally disturbed by the gravitational field comparatively to the other possible choices 
of the instant of observation. Hence the discrepancy 5 = tp — ( may be interpreted to be 
caused just by the gravitation deflection of the rays from the master source when they are 
passing through the region nearby the central body (and the influence of the observer's 
orbital motion as wefl). 

The angle S is constructed essentially from the values directly measured in experiment 
and it seems to be the most natural candidate to the role of the rigorous notion of the 
observable gravitational deflection angle. This definition of the defiection angle will be 
named local since all the measurements are carried out over the arbitrarily small parts of 
the light rays (formally, over the tangent vectors to rays). 

We assume for simplicity that the projections of geodesies forming the images of the 
master source are the same for the both observations, i.e. it is the ray C from the master 
source which is registered during the calibrating observation. In particular, it corresponds 
to the same A-parameter Aq > (meanwhile the e-parameter is opposite: e = — ). 

Our assumption fixes the instant C of the calibrating observation unambiguously. It 
seems also to be plausible from a geometrical point of view. Indeed, one may say that it 
enables one to compare the directions of the motion of a photon just before and after it 
has been undergone the action of the gravitational field in the central region (see Fig. 1), 
the 'standard' for comparison being the congruences of rays from the reference source. 

It seems worthwhile however to make the additional remark here. 

There is stfll some uncertainty in the understanding of what a position of the observer 
corresponds to the minimally disturbed position of the master and reference sources on 
the celestial sphere (certain distortion is of course inevitable provided the gravitational 
field exist; to be more exact, it is impossible to define what a position of the source might 
be understood as 'undisturbed' one and our occasional use of this word is not, strictly 
speaking, quite legal). In some extent this is a matter of convention in fact. This makes 
our above definition of the defiection angle not absolutely certain. Intuitively, our choice 
allows to describe the defiection of a single ray which is registered during the principal 
observation. Its precise formulation given above is rather convenient for calculations and 
has more or less clear interpretation but perhaps significantly less suits the realization in 
practice. Moreover, it has a disadvantage to become somewhat obscure when a approaches 
the right angle and fails when a exceeds it. 
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On the other hand one may argue, especially for comparatively large ip, that the 
positions of sources would be less disturbed provided the 'radial vector' * during the 
calibrating observation is the bisector of the angle between directions pointing the sources 
(see the Fig. 2). The 'bisectorial' choice of the instant of the calibrating measurement 
possesses also a pleasant feature to yield a maximum to the pure gravitational contribution 
(see Section 4 below) to the deflection angle, as a simple symmetry-based speculation 
shows. 




Figure 2. It is similar to Fig. 1 but the point C, that replaces 
C of Fig. 1 and corresponds to the instant of calibrating obser- 
vation, is chosen in such a way that the radial direction divides 
the angle constituting in C by the rays from the reference and 
master sources into two equal parts. 



Nevertheless we prefer here to make the definite choice mentioned above remember- 
ing that this point may require additional discussion. Besides, the 'bisectorial' choice of 
the moment of calibrating measurement is briefly analyzed in the Appendix where the 
corresponding version of the main resulting formula is derived. 

Now let us continue. It is clear from the Fig. 1 that the rays /C and M are 'almost 
parallel' and thus the A-parameter of the ray /C is negative provided the angle cr -|- ^ is not 
very small (we have assumed that ( at least is not very small). We designate however Ai 

* The notion of the 'radial direction' (pointing to the center of central body) is not quite 
obvious in om case due to effect of relativistic aberration. It can be described in a constriictive 
manner but will possess some unusual properties. To avoid undue complications, it is reasonable 
here to assume for a time that the rotation of observer is sufficiently slow to be neglected. 



the absolute value of the corresponding A-parameter and shall introduce the minus sign 
straight into the relevant formulae, e = — for the ray K, near C. 

Thus as a result of the two observations the following four angles become known: the 
disturbed {() a-^d undisturbed {^) arc distances between the sources, the arc diameter x of 
the central body and the angle k between the master source and the limb edge during the 
principal measurement. Our goal is to express the deflection angle 5 = -0 — C as a function 
of the parameters set in experiment (-0, x, k, r, u) and the space-time characteristics (such 
as the mass m of the central body). This can be achieved as follows. 

In accordance with the Eq. (4) one has the following expressions for the vectors 
tangent to the light rays: 

to the ray C in the point C c^o = V^(Ao, — ), 
K C cVi = V^{-Xu-), 

C P p^^o = ■K.(Ao,+), 

M P PV2=VUX2,+), 

Ar+ P +v, = V^{X,,+), 

Af- P -V, = VU-X3,+). 

Then the Eq. (5) implies the following relations between the four basic angles defined 
above and hght rays parameters: 

il> = arctan U{Xi,—uj) + arctan U{Xq , uj) , 
^ = arctan?7(A2,a;) — arctanC7(Ao,w), 

(6) 

X = arctan ?7( A3 , + arctan C/( A3 , — , 
K = arctan ?7(Ao, o;) — arctan C/( A3, a;). 

Here U{X,uj) = W(X,uj)/T{X,uj) 

= (XH-^^^F - u)H^I'')[{F - J'H){1 - X''F/H)]-^^\ 

The choice of the positive branch of the square root is assumed throughout. 

Eqs. (6) establish the local 1-to-l correspondence between the set of A-parameters of 
rays and the set of measured angles. 

The additional equation should express the fact of the intersection at a fixed r value 
of the two rays /C and A4 from the reference source parallel in asymptotic with the ray C 
from the master source. Indeed, one has the following dependencies of the coordinate (f) 
on r along the rays: 

on the ray C near the point C (/) = /(r, Aq), 

/C C (f> = 4>- I{r,Xi), 

L P (/. = J(r,Ao), 

M P (/> = + J(r,A2). 

Here is an unknown constant (equal to the asymptotic angle between the master and 
calibrating congruences) and 



/>oo 

I{r, A) = A / Zdr, Z 

J T 

J{r,X) = \( f + f )zdr 
2/(i?(A),A)-/(r,A). 
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(7) 



i?(A) denotes the maximal root of the equation H = A^F; it is assumed to be simple. 
The intersection of rays in P and C implies the equations 

/(r, \o) = 4>- I{r, Ai), J(r, Aq) = + J(r, A2). 



Ehminating 0, the equation follows 



/(r,Ao) + /(r,Ai) = J(f,Ao) - J(r, A2) 4» 
2[/(i?(Ao), Ao) - /(r, Ao)] = /(r, Ai) + 2/(i?(A2), A2) - /(r, A2). (8) 

Together with (6) one has the system of five equations. After the excluding of the four 
imknown A-parameters, the relation between the angles ^, -0, x, k can be established. If 
it is expressed in the form S = il> — ( = S{il>,x,i^) the problem may be considered to be 
resolved. 

It remains to reduce the solution to a form more suitable for applications in the typical 
situations. 

3. The deflection formula 

Let us notice that there are two small parameters in our formulae. The first of them 
is the ratio of the gravitational radius 2m of the central body to the A-parameter minimal 
among Aj (if all the rays escape the central body then, in the case of Sun-Earth system, 
for example, one will have m/Xj < 10~^). Another small parameter is the 'speed' of the 
observer's rotation (ujr). In principle, it may be of any value less than the unit but in 
the case of a free motion along a circular orbit in the Schwarzschild field one has precisely 
(urY = m/r (the generalized Kepler's law). Hence it is reason able t o assume that in any 
physically meaningful case (ur) cannot be much greater than y^m/r. At least, in the case 
of the Sun-Earth system one has as a typical value cor :^ lO""^. 

Clearly, it is convenient to apply the Taylor expansion with respect to the mentioned 
small parameters. Some care must be taken however to decide what the orders of terms 
are to be kept. 

To that end, let us notice that the recent light deflection observations for the Sun-Earth 
system by means of the VLBI method confirms the value 7 = 1 of the space curvature pa- 
rameter 7 of the PPN approximation [6,7] with the accuracy about 0.002 [4]. The deflection 
angle is proportional to 1 -|- 7 and, thus, permitting ourselves somewhat free interpreta- 
tion, we may assume that the deflection angle can be experimentally measured with the 
accuracy within 0.1% of its maximal value (1.7 arc seconds or 10~^ radians), i.e. 10~^ 
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radians (see also [12], pp. 5,155). One may neglect therefore any contributions significantly 
less than this measurement uncertainty e ~ 10~^ radians without prejudice to the result. 
Hence, the only small parameters powers that have to be kept are ur, (ujr)^ ,m/ Xj ~ m/r 
and, conditionally, {ujr){m / Xj) ^ muj. All the other terms will be dropped. 

We shall say that this approximation is of the order 3/2 (since the last kept term is 
typically estimated as mco ~ (m/r)^/^). 

The following expansion of the order 3/2 

arctan U{X, ±uj) ~ a =f (ojr) cos a — (m/r) tan a — jiojr)^ sin 2a =f (moj) sec a, 

where sin a = X,/r < a < 7r/2, and the Eqs. (6) yield the representations of the 
angles il>,(,x,n through the auxiliary unknowns aj = arcsin(Aj/r). The difference of the 
expansions of the two first equations (6) yields 

S = il> — ( c::^ ai + 2ao — 

+ (a;r)A+ [cos (•)] + {mlr)ISr [tan (•)] (9) 
+ \{ijor)^ISr [sin2(-)] + (m/r)(ra;)A+ [sec(-)] • 

Here the following functionals are introduced for convenience: 

A±[/(.)]^/(a2)±/(ai)-2/(ao) 

for every function j{a). The Eq. (9) determines the desirable defiection angle but its r.h.s. 
contains the unknown a's. 

We need now the additional equation (8) in order to exclude them. Let us fit it to 
the Schwarzschild case when F = = 1 — 2mlr, H = r^. After the substitutions, the 
corresponding indefinite version of the integrals /, J (7) takes the form 

A y |r [r^ - A^ (1 - 2m/r)] } ^ dr = X j [r {r - pi) {r - p^) {r - pz)]'^''^ dr, 
where pi = A(S - ^), p^ = 2A^, ps = -A(S + ^) 

and C = = 3-^/^ sin (3"! (arcsin [S^/^m/X])) , S = S(0 = (1 - H'^Y^^- (10) 

This expression determines precisely the first kind elliptic integral but it is more 
convenient for us to work on with its approximate representation for small ^ in terms of 
elementary functions. 

To that end, let us notice that if < ^ < l/(2-\/3) the maximal root i?(A) of the 
equation H = X^F equals pi and is simple. Passing to the new integration variable 
u = [r/X — S + ^]^/^ one obtains from (7) 

poo 

I{R{X),X) = Iom))= / n{u,Odu, 

Jo 



/(r,A) = /i(e(A),r/A) = n{u,C)du, 

where n = [(z.^ + S - ^) (z.^ + s - 3^) {u' + 2E)]-'^' , 

L = L{r/X,0 = {r/X-E + Cf/\ 

The latter representations have the advantage to be exphcitly smooth in the vicinity of 
the point ^ = and the Taylor expansion can be applied to them straight. Rirthermore, 
it is sufficient to keep only linear terms in the expansion. Indeed, in accordance with the 
definition of pi, 

X = R(X)/[(1-3C^Y^^ -C]> RiX) ^ m/Xj<m/R, 

provided < ^ < 0.5. On the other hand R{X) is the minimal value of the coordinate r 
along the light ray and it at least exceeds the radius of the central body provided the ray 
escapes it. [This seems to be too restrictive for the rays /C, M but one can infer from the 
Fig. 1 that it holds if ( and ip are not comparatively smah]. Hence we may assume m/R 
to be sufficiently small {m/R c:^ 10~^ for the Sun) to neglect all its powers except the first. 
This is true for all m/Xj (and m/r) as well. Such a practice of the linear approximating 
with respect to the gravitational radius 2m is in general use in the calculations of deflection 
angle during estimations of the (7)-like integrals. 

Under these conditions, one has in the first approximation 



^ ~ m/A < 1 
and /o(^)^/o(o) + ^4(0) 

^2^2+ 2} du + 2Cj {{l^^+l)'Vl^'+2] dv 
TTl 

= 7r/4 + ^ ~ 7r/4 + m/A = ^tt H coseca, 

Ii iC, r/X) - Ii (0, r/X) + idih (0, r/A) 



-(A/2r) ((r/Af - 1) 2 T [{v'' + if ^/'^^^] ' 



dv 



= \a-\- — cosec a{l — \ cos a — |- sec a) 

where Lq = (r/A — 1)^/^ and, as above, sin a = A/f, < « < 7r/2. 
Then the first order approximation to the Eq. (8) takes the form 

2q;o + Oil — Q!2 + (2m/r)[cosec ai + cosec (12] 

+ (m/r)[cotan a2 — cotan ai — 2cotan a^] (11) 
+ (2m/r)[cosec 2q;2 — cosec 2ai — 2cosec 2q;o] — 0- 



L6 

This approximation can be considered to be of order 3/2 as well since the neglected 
terms 0{{m/\Y) are of the second order. 

Eq. (11) together with Eq. (9) imply the following one: 

S ~ {Li)r)A'^ [cos (•)] 

+ {m/r){A~ [tan (•)] - A" [cotan (•)] 

- 2A- [cosec 2 (•)] - 2A [cosec (•)] } (12) 
+ l{corfA- [sin2(.)] 
+ (ma;)A+ [sec(-)] • 
Here we have introduced additionally the functional 

A[/(.)] =/(a2) + /(ai). 

The following remark concerning the Eq. (12) is useful. Its r.h.s. still contains the 
unknowns aj but now all such terms have the small coefficients (at least of the order 1/2). 
Hence it is sufficient to substitute the first order representation for aj in order to obtain 
the S value up to the order 3/2. 

The corresponding first order version of the Eqs. (6) is 

X — 2q;3 — (2m/r) tan as — ^i^^rY sin 2as, 
-0 ~ + ao + uJr{cos ai — cos ao) 

— (m/r)(tan a^ + tan ao) — ^(i^?')^(sin 2ai + sin 2ao), 

K aQ — as — a;r(cos ao ~ cos as) (13) 

— (m/r)(tan ao — tan as) — j{ior)^{sm 2ao — sin 2a3), 
a = K + x/2 ~ ao — a;r(cos ao — cos as) 

— (m/r ) tan ao — ^ (cor)^ sin 2ao . 

The Eqs. (11), (13) constitute a closed system with the following first order solution 



(14) 



ao 1 






ai 


hi 




a2 J 






as 


~ x/2 + {m 



Here 



i(tjr)^ sii 



Ao = cos cr — cos(x/2), Bo=tancr, 
Ai = cos x/2 — cos('0 — cr), Bi = tan('0 — cr), 
A2 = 2Ao + Ai = 2 cos a — cos x/2 — cos('0 — cr), 
B2 = 2 tan (7 + 2 tan('0 — cr) 

+ 2A[cosec(-)] + A"[cotan(-)] + 2A"[cosec2(-)], 
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and the functionals 

A-[/(.)]^/(^ + a)-/(^-a)-2/(a), 
A[/(.)] + + 

have been introduced for convenience. (The explicit expressions of Cj will not be used in 
what follows and are omitted here.) 

After a substitution for these representations of aj into the Eq. (12) one obtains 

S - (cor) A + {m/r)B + (corf C + (mw)D, (15) 



where A = A"'"[cos(-)], 

B = A-[tan(-)] - A-[cotan(-)] - 2A-[cosec2(-)] - 2A[cosec(-)], 
C = A-[sin2(-)] - A[sin(-),A], 
D = A+[sec(-)] - A+[sin(-),B] 
+ A" [sec^ (•), A] + A" [cosec^ 

+ 4A- [cos 2(-)cosec^2(-), A] + 2A[cos(-)cosec^(-), A]. 
Besides the A-functionals being defined above we have used here 

A+[/(-)] ^f{iP + a) + f{iP-a)-2f{a), 
A±[/(-), Y] = Y,f{i> + a)± Yi/(^ -a)- 2Yo/(a), 
A[/(.),Y] ^Y,f{i> + a) + Y,f{i>-a). 

Eq. (15) is the almost final result. It remains only to recast it to a decent form. After 
some transformations one can obtain the following rather compact representations for the 
first three coefficients in (15): 



A = -2(1 - cos'0)coscr, (16) 

B= 4cotancr ■ — , (17) 

sm ip + sm a 

C = —2(1 — cos'0)(sin'0cos^ a — sincrcoscr + sin cr cos x/2). (18a) 

The second formula turns out to be surprisingly simple comparatively with its initial 
form. 

As to the fourth coefficient D), it seems unhkely that it could be reduced to a more 
simple and at the same time useful form than as it stands. Fortimately. we need not such 
a representation in fact since only the unique leading term in the whole ID expression is 
valuable in a typical situation as we shall see below. 



it) 

4. The light deflection in the Sun-Earth system 

It is useful to remind here the orders of main parameters for the typical case of the 
observation from the Earth's orbit of the light deflection near the Sun. One has in this 
case 

2m ^ 2.5 km, uj ^ 10~'^ sec~\ r^^ 1.5 10^ km, x ~ 16' ?^ 0.005 rad. 
and our two basic small variables of expansions are of order 

ojr ~ 30 km/ sec ~ 1. lO""^ c (light speed), m/r ~ 1. 10~^. 

Hence 

(corY m/r ~ 1. 10~^, moj ~ (cor)^ ~ 1. 10~^^ 

[The first equality is also implied by the free character of the orbital motion of the Earth]. 
Further, as we have mentioned above, the basic level of the admissible uncertainty in our 
formulae is e 10~^ (radians). Let us analyze the Eq. (15) conformably to that case. 

Formulae (16), (17) need no further simplifications. 

The following obvious estimate holds for the third term of (15): 

< -{corYC < 4e. 

which thus less then 4 times exceeds the threshold of detect ability. In accordance with this 
estimate one may zero x fh^ C expression (18a) for any admissible tp. This may cause 
an error not more than ex^ in magnitude that one may neglect. Thus we may assume 

C —2(1 — cos'0)[cos^ crsin-^ + sin (7(1 — coscr)]. (18&) 

Further, the most interesting is the case when the pure gravitational contribution to the 
angle S is maximal. As we shall see below it occurs when a (restricted from below by x/2 fti 
0.0025) is minimal. In such a case of small cr ^ x one may use a more rough approximation 
which is yielded by the setting cr = in (186). Then the simplest representation of C 
follows: 

C ?^ -2(1 - cos'0)sin'0. (18c) 

Finally, let us revert to the Eq. (15) and consider its last term. At first, we must notice 
that the expression defining O contains the terms diverging when il> = a, the case quite 
admissible from the both physical and geometrical points of view. One can distinguish 
the following two apparently diverged aggregates involved in O: the first can be extracted 
from the last three terms of the expression defining O and is equal to 

— 4Ai cos 2{'ip — cr)cosec^2('0 — cr) — Aicosec^('0 — a) + 2Ai cos('0 — cr)cosec^('0 — cr), 

and the second is the fragment of B2 equal to 

— cotan('0 — cr) — 2cosec 2{'ip — cr) + 2cosec {ip — a). 



They however can be recast to 

Ai(2cos('0 - cr) + 1) sin^ ^{ip - a)[cos{ip - a) cos - a)]~^ 
and — 2sin^ ^(ip — a)[cos{ip — a) cos ^{ip — cr)]~^ 

respectively and therefore do not really diverge; moreover, they both vanish provided 
il> = a. 

This seeming singularity is clearly caused by the chosen way of the approximating 
of the basic equations only. Indeed, if ip = a the leading term of Ai expansion vanishes 
and the assumption m/Ai <C 1 which we have used when estimated the integrals fails. 
The complete result remains however unaffected and no special care on that point seems 
to be necessary. Notice that a similar cancellation of the seeming singularities has been 
encountered in the reducing of the expression defining B as well. 

Since the coefficient in the last term of (15) (mu) :^ 10~^e is very small it is sufficient 
to keep only those parts of the whole EI) expression that significantly exceed the unit. Such 
terms exist only if cr -C 1. The B2 aggregate contains the sum 

— 2cotan a — 4cosec 2a = — 2(cos a + sec cr)cosec cr ?^ — 4/ sin cr. 
Besides O contains itself the expression 

— 8A0 cos 2cr cosec^ 2cr — 2Ao cosec^ cr, 

but it is equal to 

2[1 - y^^ ][2 -tan^ cr] (cos (7 + cos 
sin cr 

and does not considerably increases as a decreases (we would remind that cr > x/2). Thus 
the maximal contribution to O is due to the expression — 4sin('0 + cr)/ sin cr and one has 

THi . sin -0 

WJ ^ -4 -, 

sin cr 

provided cr -C 1. 

We see now that under the typical conditions even in the most favorable case 

|(mtj)D| < O.le 

and thus the contribution due to the last term in (15) is at least an order less than the 
current measurement uncertainty. 

The formulae (15)-(19) yield the final expressions for the locally measurable deflection 
angle. In the case of free orbital motion of observer one has additionally u)r = ^/m/r. 

5. Discussion 

Let us briefly discuss the separate contributions to the deflection angle (15). The sum 



(19) 



5,b = K)A + K)'C (20) 

does not depend on the mass m of central body and is determined mainly by the orbital 
speed {u)r) of the observer. It can be clearly classified as to describe the effect of the 
relativistic aberration calculated up to the second order of the speed (the third order 
contribution could be showed to be negligible). One may name it kinematic deflection 
angle as well. The following estimate holds for it in the case of the Sun-Earth system 

< -5ab < 2 10^ e 

The aberration contribution dominates in absolute value. However it is not involved as 
a rule in the theoretical analysis of the gravitational hght deflection, perhaps, as a side effect 
existing even without any gravitational field. The origin of the aberrational contribution to 
the deflection angle is the different mutual orientations of the registered rays and the orbital 
speed of observer during the principal and the calibrating measurements respectively. Thus 
this part of the deflection angle is not really related to any light rays deflection and our 
terminology is perhaps not very adequate in this point. Anyway, the aberration dependent 
angle (20) contributes the experimentally measured difference of the 'undisturbed' and 
'disturbed' arc separations between the master and reference sources and that is why it is 
kept in the deflection formula (15). 

It is worthwhile to mention here that due to the Eq. (18a) the aberration contribution 
to the deflection angle formally depends on the arc diameter x of fhe central body. This 
seems to be rather strange but really means only that if the measurements are carried out 
several times with different x values but constant all the other conditions of experiment the 
corresponding values of a will slightly differ precisely in such a way to ensure the deflection 
angle (15) to remain unchanged. One may say that the direction to the center (described 
by a) will depend on x but there is no contradictions in such a statement because the 
'irrelevant' parameter ip is not involved in this dependence. 

The last term of (15) 

5g_k = (ma;)D (21) 

is of a mixed gravitational-kinematic nature. For the Sun-Earth system its value does 
not exceed O.le and hence its detection should require at least 10-fold improvement of 
the measurement accuracy. At the recent level of the accuracy of experiments it can be 
temporarily omitted. 

The most important result implied by the formula (15) is the expression for the most 
interesting pure gravitational contribution to the deflection angle which would coincide 
with the total deflection 8 if there were no rotation of observer (w = 0) or, to be more 
exact, if his rotation were sufficiently slow: 

2m, , sini/) 

S„ = (2 cotan a) ■ — . (22) 

r sm ip + sm a 

It is useful to compare it with the classical expression (see [3], Eq. (AlO), or [6], Eq. 
(40.11)) 



4t = — (cotan-J (23) 

The main formal difference with (22) is the manner of entrance of the factor 2: under (in 
(23)) or outside (in (22)) of the cotan symbol The 'compromise' between these two for- 
mulae is thus impossible as far as the function tan deflects from the linear proportionality. 

To be more exact, the latter formula pretends to determine the absolute deflection 
angle, meanwhile we are operating with more physically meaningful relative one. Hence 
its differential version 

4t = 4t((7) - Sst{a- + C) 

= 4t(cr) - 4t(cr + i>) + 0{mu) 

~ [cotan— — cotan — - — J ^ ^ 

2m . , 2 sin -0 

= (cotan G + cosec o) — 



sin -0 + sin('0 + cr) + sin o 



is the more appropriate object for comparison. 

The formulae (22) and (24) are clearly dilferent though rather like. The leading terms 
of the expansion of their discrepancy with respect to a (assuming it to be small) are 
estimated as foflows: 

2?77; lb (T lb 

S,t - Sgr = —[tan |- - 2 ^^^^ f + ~ 2etan(^/2). (25) 

and do not vanish even a is arbitrary small. Calculations with the complete formulae 
(22), (24) for ip = 60° confirm that 4+, — "^gr is almost constant and slowly decreases from 
0.29(4m/7') to 0.27(4m/7') (we would remind that accidentally m/r ~ e) as cr increases 
from 0.003 radians (source close to the edge of the limb) to 0.5-0. Thus it is of order 
of the measurement uncertainty under the typical conditions. It is also interesting to 
estimate the similar difli"erence when the second order aberration term {ujrY C and the 
cross gravitational-kinematic term {muj)J} are taken into accoimt as well {i.e. the complete 
deflection angle S without the first order aberration term is used). The calculation shows 
that it increases approximately to 0.5 (Am/r). Thus the difference — ^gr is close to the 
quadratic aberration contribution. 

We shall not discuss here in the fufl details the origin of the defect of the classical 
formula (23) which is manifested in its disagreement with Eq. (22). We only remark that, 
as to our opinion, it has arisen due to the implicit attaching of too great importance to 
the Euclidean geometry of the sheet of paper where the theorists depicted the light rays 
and observer's trajectories. 

More correctly, the connection between the space-time geometry and the results of 
observations has been built in the way inadequate to the effect under consideration. It 
seems nonsense to seek the mistakes in the (rather transparent) calculations in a num- 
ber of wellknown textbooks and monographs, including refs. [6-13], analyzing the light 



deflection in the Schwarzschild geometry: these calculations are certainly correct. The dis- 
crepancy arises before the calculations are initiated. They are contained in the principles 
of description of the deflection angles measurement. 

There is a lot of expositions of the calculations leading to the formula (23), but 
the analysis manifests the following their common feature: the interpretation in physical 
terms of the geometrical objects dealing with the inevitably Zoca/ measurements of the light 
deflection involves some non/ocal facflities. As a rule the Minkowski space geometry is used 
although it is not sometimes explicitly declared. PPN method, which is used, for example, 
in [6,7,12-15], is directly based on the auxihary Minkowski space. On the other hand in 
the book [8] the deflection angle is simply identified with the total variation mod tt of the 
coordinate (p along a null geodesic. This result can be equipped by an invariant geometrical 
meaning of course (this has been done in [5], for example) but it is clear that such an angle 
can be 'measured' only in terms of the geometry of the auxiliary flat space-time and is 
in no way a local quantity. [The nonlocal operation here is the parallel transport of the 
vector tangent to the ray from the beginning to the end of the ray in order to compare it 
with the vector tangent to the ray therein. The space through which transport is carried 
out must be flat, otherwise the result would depend on the path of transport.] Similar 
remark can be done with respect to refs. [9-11,16] as well. 

Another typical example can be found in [17]. The straight lines constituting the 
angle (po in the FIGURE 1 (p. 69) do not exist in the curved space-time and one is forced 
to introduce the auxiliary Minkowski space in order to attach the geometrical meaning to 
the formal calculations therein. It cannot be realized in a unique way however. 

It is worthwhile to note that similar pictures can be found in almost every work and 
they {i.e. the corresponding underlying flat space geometry) are really used in calculations. 

And as the flnal example, in the pioneering paper [2] the working tool is the system 
of isotropic coordinates in the Schwarzschild space that are in fact interpreted as the polar 
coordinates in some flat space which also is not really manifested as the arena of a physical 
measurement. 

We have therefore two weak points lying in the foundation of the Eq. (23): 

(i) The nature of the geometrical tools used for its derivation is nonlocal contrary to the 
local nature of the observation that at least require a separate study, and 

(ii) the Minkowski space cannot be uniquely distinguished in the curved space-time even 
if the curvature of space-time is small. 

The latter fact has been mentioned in [18] but it has to be connected with inadequate 
approach based on "perturbations around flat space-time" rather than with the physical 
essence of the problem. 

On the contrary, our approach does not reveal any such sort ambiguity. It is based in 
fact on the metric (3) alone, that simply realizes the principle of the constancy of the local 
light speed. In these frames, the introduction of auxiliary Minkowski space and further 
development of a perturbation scheme are unnecessary (and inadequate) complications in 
fact *. 

* This is a reason why it seems to be not very fruitftil to perform a detailed comparison of our 
aberration formula (Eqs. (16), (18), (20)) with its standard counterpart which can be found, for 
example, in [12], p 64, because the latter one is also based on the PPN approach. 



Some uncertainty that exists due to possibility of different definitions of what moment 
of time is most reasonable for calibrating measurement (see Section 2) is connected with the 
physical essence of problem and can be removed, for example, by a reasonable convention. 

Additionally to the Eq. (22), the gravitational deflection angle for 'bisectorial' choice 
of the moment of calibrating measurement is calculated in the Appendix (Eq. (A22)). 
It also does not coincide with the standard formula (24), the discrepancy being of order 
e(4tan|'0), i.e. similar to that in Eq. (25). 

Finally, it must be emphasized that the validity of our approach is in no way related 
to the above speculations concerning with the other more usual approaches and must be 
estimated independently on their estimate. 

6. Resume 

In this work we have found the basic equations determining the value of the angle of 
light deflection as it is registered by the observer uniformly rotating in a static spherical 
symmetric gravitational held along a circular orbit (Eqs. (6-8)). Their derivation is per- 
formed in frames of the standard general relativity and is based exclusively on the principle 
of the constancy of the local speed of light. In particular it does not involve any artificially 
constructed 'frames of reference', Fermi- Walker transport, PPN-approximations or similar 
tools. 

The result has been further fitted to the case of measurement in the Earth-Sun system 
from the Earth's orbit and the formulae sufficiently simple for applications have been 
obtained (Eqs. (15-18)). 

The whole deflection angle is divided into the several contributions: the contribution 
of the relativistic aberration (20) (kinematic deflection of the flrst and second orders), the 
pure gravitational deflection (22) and the mixed gravitational-kinematic deflection (21). 

Our calculations prove that the classical formula 8,^t = (2m/r)cotan((T/2) does not 
determine the gravitational deflection comprehensively even if it is adopted to a really 
used differential measurement scheme and the rotation of observer can be neglected. 

The necessary correction to the gravitational deflection is expected to be of order e 
where e ~ 10~^ (radians) is the level of the uncertainty for the recent radio- waves deflection 
measurements by means of the VLBI method. Thus one may hope that the this discrepancy 
can be detected by means of the up-to-day or perhaps somewhat improved experimental 
technic. 

This task seems to put forward rather complicated problems however. At flrst, there 
is a number of masking effects affecting the observed deflection angle that we have not 
touched on in this paper at all (for example, the gravitational fleld and the own rotation of 
the Earth, the nonzero eccentricity of the Earth's orbit, the refraction of the atmosphere 
of the Earth and the solar corona etc., see also [4]). They are to be taken in account 
independently if one intends to analyze the real experimental data. The crucial point, 
however, seems to be a necessity of a sufficiently accurate estimation of the angle cr, i.e. 
the observed direction to the center of the Sim in fact. The VLBI method seems to be not 
very useful for this purpose and perhaps some new ideas should be drawn in. 

Finally it is worthwhile to notice that our requirement for the both observed sources 
to belong to the ecliptic could be easily removed resulting some geometry complication 
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only. This should somewhat improve the model plausibility. 
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Appendix 

Here we calculate the hght deflection angle in the case of alternative choice of the 
moment of 'minimally disturbed' positions of the master and reference sources on the 
celestial sphere which has been mentioned in the Section 2. Now we shall assume that the 
bisector of the angle between the sources is precisely of the outgoing radial direction. The 
scheme of observations is depicted in the Fig. 2. Its only difference with the Fig. 1 is the 
position of the point of calibrating observation which will be now denoted C It does not 
lie generally speaking on the ray £ but rather on another ray £' from the master source. 
Obviously the position of C is uniquely determined by the congruences of rays from the 
sources and does not depend on the choice of the moment of principal observation P. 



The calculations of the light deflection angle in the case C reproduce mutatis mutandis 
ones in the Sections 2 and 3. 

Geometrically, the point C is replaced by the point C, the geodesic which corresponds 
to the ray £' being characterized by the A-parameter Ai instead of Ao for C 

Analyticafly, the first equation of the set (6) is replaced by the equation 

ip = arctan t/( Ai , —oj) + arctan C/(Ai , oj) (AG) 

meanwhile other equations (6) stifl hold. (We use the notations borrowed from the Sections 
2,3.) 

Instead of Eq. (8) one has now 

2/(r, Ai) + /(r, Ao) - /(r. As) + 2[-/(i?(Ao), Aq) + I{R{X2), M)] = {A8) 
that can be reduced in the first approximation with respect to m/r to 

2ai + do — H { 2(2cosecQ;i — cosecao + cosecas) 

— 2(2cosec2Q;i + cosec2Q;o — cosec2Q!2) 
— (2cotan ai + cotan ao — cotan as ) } — 

All the other relevant equations will follow from ones mentioned above. 

Here however we shall be interested only in the pure gravitational contribution to the 
deflection angle and hence assume the rotation of observer to be sufficiently slow to neglect 
it everywhere. Additionally, this enable us to avoid the necessity of the special definition 
of the 'radial direction'. We omit therefore all the terms involving the rotation frequency 
parameter u. 

The defiection angle is determined by the equation 

771 

^max = i' — C — 2cki + ao — a2 (2 tan ai + tan ao ~ fa-n (^2), (^9) 

which in view of (^11) takes the form 

^max = — {— 2(2cosecai — cosecao +coseca2) 
r 

— (2 tan ai + tan ao — tan as) 
2(2cosec2ai +cosec2ao — cosec2a2) 
+(2cotanai + cotan ao — cotan as)}. 

It remains to express aj in terms of angles a,i>, i.e. to derive the counterpart of Eqs. 
(14). It is now sufficient however to derive their zero order approximation. In the same 
way as in the Section 3 one easily obtains 

ao ^cr, 

ai -i^, (AU) 
as ^cr + ip 



(AU) 



{A12) 
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up to the terms 0{mjr) (that would yield 0{{m/rY) contribution to ^max which is ne- 
glected). 

After the substitution for (^14) to (^12) and some transformations one obtains 

^max ^ 4t - ^ tan 1-0 

1 2 tan jil) 9 i . , „ x / .^^x 

= — cosec|cr r— ^— -cos jfi/) + 2cr) (^22) 

r 2 sin|('0 + (7) 4vr y 

that is the desirable result. 



